Data Sources for Trait Databases: Comparing the Phenomic Content of Monographs and Evolutionary Matrices
نویسندگان
چکیده
Databases of organismal traits that aggregate information from one or multiple sources can be leveraged for large-scale analyses in biology. Yet the differences among these data streams and how well they capture trait diversity have never been explored. We present the first analysis of the differences between phenotypes captured in free text of descriptive publications ('monographs') and those used in phylogenetic analyses ('matrices'). We focus our analysis on osteological phenotypes of the limbs of four extinct vertebrate taxa critical to our understanding of the fin-to-limb transition. We find that there is low overlap between the anatomical entities used in these two sources of phenotype data, indicating that phenotypes represented in matrices are not simply a subset of those found in monographic descriptions. Perhaps as expected, compared to characters found in matrices, phenotypes in monographs tend to emphasize descriptive and positional morphology, be somewhat more complex, and relate to fewer additional taxa. While based on a small set of focal taxa, these qualitative and quantitative data suggest that either source of phenotypes alone will result in incomplete knowledge of variation for a given taxon. As a broader community develops to use and expand databases characterizing organismal trait diversity, it is important to recognize the limitations of the data sources and develop strategies to more fully characterize variation both within species and across the tree of life.
منابع مشابه
MatrixConverter: Facilitating construction of phenomic character matrices1
UNLABELLED • PREMISE OF THE STUDY While numerous software packages enable scientists to evaluate molecular data and transform them for phylogenetic analyses, few such tools exist for phenomic data. We introduce MatrixConverter, a program that helps expedite and facilitate the transformation of raw phenomic character data into discrete character matrices that can be used in most evolutionary i...
متن کاملA Note on Evolutionary Rate Estimation in Bayesian Evolutionary Analysis: Focus on Pathogens
Bayesian evolutionary analysis provide a statistically sound and flexible framework for estimation of evolutionary parameters. In this method, posterior estimates of evolutionary rate (μ) are derived by combining evolutionary information in the data with researcher’s prior knowledge about the true value of μ. Nucleotide sequence samples of fast evolving pathogens that are taken at d...
متن کاملQuantifying and comparing phylogenetic evolutionary rates for shape and other high-dimensional phenotypic data.
Many questions in evolutionary biology require the quantification and comparison of rates of phenotypic evolution. Recently, phylogenetic comparative methods have been developed for comparing evolutionary rates on a phylogeny for single, univariate traits (σ(2)), and evolutionary rate matrices (R) for sets of traits treated simultaneously. However, high-dimensional traits like shape remain unde...
متن کاملRelational Databases Query Optimization using Hybrid Evolutionary Algorithm
Optimizing the database queries is one of hard research problems. Exhaustive search techniques like dynamic programming is suitable for queries with a few relations, but by increasing the number of relations in query, much use of memory and processing is needed, and the use of these methods is not suitable, so we have to use random and evolutionary methods. The use of evolutionary methods, beca...
متن کاملAn Investigation of Information Seeking Behavior of Graduate Students of Kharazmi University in Writeing Dissertation
Background and Aim: These days, Internet and online databases play a significant role in peoples’ information-seeking behavior, especially for postgraduate students. The kind of behavior seen on the part of those students who get help from online databases to write their theses might be completely different from those who search library sources to get the needed information. This study invest...
متن کامل